Red Storm Capability Computing Queuing Policy

نویسندگان

  • James A. Ang
  • Robert A. Ballance
  • Lee Ann Fisk
  • Jeanette R. Johnston
  • Kevin T. Pedretti
چکیده

Red Storm will be the first Tri-Lab [Sandia National Laboratories (SNL), Los Alamos National Laboratory (LANL), and Lawrence Livermore National Laboratory (LLNL)], U.S. Department of Energy/National Nuclear Security Administration, Advanced Simulation and Computing (ASC) platform to be managed under an explicit capability computing policy directive. Instead of allocating nodes among SNL:LANL:LLNL in the 2:1:1 ratio, Red Storm will use PBS-Pro (the commercial version of the Portable Batch System), to manage priorities among the labs so that in the long run their node-hours of usage will follow the 2:1:1 ratio. The basic queuing policy design is described along with extensions to handle switching between classified and unclassified, use by ASC university partners, priority access, etc.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating Real Power Usage on Red Storm

High Performance Computing (HPC) has historically been the impetus for new technologies. However, in dealing with power related issues HPC has lagged behind. Power has recently been recognized as one of the major obstacles to fielding a Peta-Flop class system. In this paper we will discuss power related topics by leveraging what is currently a rare capability of examining real power usage at a ...

متن کامل

Towards a Specification for Measuring Red Storm Reliability, Availability, and Serviceability (RAS)

The absence of agreed definitions and metrics for supercomputer RAS obscures meaningful discussion of the issues involved, hinders their solution, and increases total system cost. Seeking to foster a common basis for communication about supercomputer RAS, [1] proposed a general system state model, definitions, and measurements based on the SEMI-E10 specification [2] used in the semiconductor ma...

متن کامل

A Comparison of Three MPI Implementations for Red Storm

Cray Red Storm is a new distributed memory massively parallel computing platform designed to scale to tens of thousands of nodes. Red Storm has a custom network designed around the Cray SeaStar network interface and router. In this paper, we present an evaluation of three different MPI implementations for Red Storm: the vendor-supported MPICH2 implementation, and two other implementations based...

متن کامل

The Markov chain model of RED active queuing management algorithm

A novel Markov-chain model of RED is proposed by the authors in the article. The paper gives a detailed description of the model and some preliminary results for exponential and self-similar traffic. The Markov chain analysis results are verified by a simulation model. 1. The RED (Random Early Drop) active queuing management algorithm The majority of nowadays network technologies is based on tr...

متن کامل

Evaluation Model Queuing Task Scheduling Based on Hybrid Architecture Cloud Systems

The applications based on cloud computing platform usually need to use a number of computing resources and storage resources to completing computing tasks, so the faulttolerant capability of system has become increasingly important. Aiming to solve this problem, an evaluation model of task scheduling is proposed based on cloud system (TSCS). TSCS can effectively model and simulate complex cloud...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005